THE PROBABILITY OF THE ALABAMA PARADOX 



SVANTE JANSON AND SVANTE LINUSSON 

Abstract. Hamilton's method is a natural and common method to dis- 
tribute seats proportionally between states (or parties) in a parliament. 
In USA it has been abandoned due to some drawbacks, in particular the 
possibility of the Alabama paradox, but it is still in use in many other 
countries. In this paper we give, under certain assumptions, a closed 
formula for the asymptotic probability, as the number of seats tends to 
infinity, that the Alabama paradox occurs given the vector pi , . . . , pm 
of relative sizes of the states. 

From the theorem we deduce a number of consequences. For example 
it is shown that the expected number of states that will suffer from 
the Alabama paradox is asymptotically bounded above by 1/e and on 
average approximately 0.123. 



1. Introduction and main result 

Proportional representation is desired in various circumstances. One com- 
mon case is in elections in many countries, where each party is awarded a 
number of seats in parliament proportional to the number of votes. Since 
the number of seats has to be an integer, it becomes a mathematical problem 
to choose these integers in a way that approximates exact proportionality, 
and a number of different methods are in use. 

In the United States, elections are done differently (with single-member 
constituences) , but the same mathematical problem exists for apportion- 
ment to the House of Representatives. By the Constitution, each state has 
a number of representatives that is proportional to its population. However, 
the Constitution does not specify by which method the numbers are to be 
determined. (The numbers are determined by Congress every tenth year, 
after a new census.) Therefore, the choice of method has been subject to 
much debate since 1791, see Balinski and Young [1] for a detailed history. 
(In 1941, a specific method was chosen by law to be used not only for that 
apportionment but also for all coming ones. This has eliminated the need 
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for debates and new choices every ten years, so there is now much less de- 
bate. For the current method, see [1], which also discusses why the method 
is slightly biased and could be improved.) 

The problem was further complicated by the fact that the Constitution 
does not specify the total number of representatives. Thus, when discussing 
apportionment, the Congress discussed not only different methods but also 
different sizes of the House. During the second half of the 19th century, 
the favourite method was Hamilton's method (also called method of largest 
remainder), which can be described as follows. (The method was proposed 
by Alexander Hamilton in 1792 for the first apportionment; it was then 
approved by Congress but vetoed by president Washington. The method 
was proposed again by Samuel Vinton in 1850 when it became law and was 
used, with some fiddling, for the rest of the century; see [1] for details.) 

Hamilton's method. Suppose that there are m states with populations 
Pi, . . . ,Pm, cLnd n seats to distribute. Let P = Y^'^li Pi be the total popu- 
lation and let pi = Pi/P be the relative population of state i, i.e., its pro- 
portion of the total population. Calculate fii = pin; this is the real number 
that would give exact proportionality. First round these down and give [/.tjj 
seats to state i. The sum of these numbers is almost always less than n (the 
exception is when all /ij happen to be integers), and the remaining seats are 
given to the states with largest remainders — [^jj . In other words, fii is 
rounded up for the states with largest remainders, and the number of states 
that are rounded up is determined so that the total number of seats becomes 
n. 

The method is simple and intuitive, and it does not bias against small 
or large states. However, in 1881 it was discovered that this method has 
a surprising and unwelcome behavior when the total number of seats is 
changed: It can happen that some state gets less representatives when the 
total number is increased. More precicely, using the population figures from 
the 1880 census, a total of 299 seats would give 8 to Alabama, but a total 
number of 300 would give only 7 to Alabama, see [1, p. 39] for details. This 
counterintuitive behaviour got the name Alabama paradox, and it eventually 
led to the abolishment of Hamilton's method in favour of others that do not 
suffer from this defect. (The same problem was actually observed already 
1871, in that case for Rhode Island, but this went largely unnoticed [1, 
p. 38].) 

Remark 1.1. The method is still used in parliamentary elections in several 
countries, either to distribute seats among multi-member consituencies (e.g. 
Sweden), or to distribute seats among the parties (e.g. Denmark). There 
the number of seats is fixed in advance, so the Alabama paradox is not 
an obvious problem. However, the paradox may surface and give strange 
behaviour in combination with other rules, and in some election systems 
where the method is used a party that gains a vote can, in exceptional 
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Figure 1. The Alabama paradox. Numbers in boldface are 
rounded up. (Population figures may be in thousands or 
millions, for example.) 

situations, actually lose a seat in parliament, see [3]. Germany used ear- 
lier Hamilton's method (there called Hare-Niemeyer's method) for federal 
elections but changed in 2008 for this reason (although the problem partly 
remains for other reasons); the method is still used in several German states, 
however. 

The Alabama paradox is mathematically not strange, once it has been 
noticed: Consider three states A, B and C. If we increase n to n + 1, the 
number /ij = npi is increased by pi. If, for example, state C is small and 
states A and B larger, then fic increases less that and //b- Suppose for 
simplicity that none of these numbers passes an integer, so the integer parts 
remains the same for n and n + 1 for all three states; then their remain- 
ders Pi = fii — increase by pi, and it may happen that the remainders 
PA and pb both are smaller than pc when we distribute n seats, but that 
both become larger than pc when we increase n to n + 1 . If furthermore C 
had the smallest remainder that was rounded up, then the result is that C 
loses one seat while A and B gain one each. (We assume that no other state 
interferes.) A simple numerical example is given in Figure 1. 

So the Alabama paradox certainly may happen, and it has occurred, but 
how likely is it? 

Of course, the description so far is purely deterministic (except for the 
necessity to draw lots sometimes when there is a tie); the Alabama paradox 
either occurs or not for given parameters. Let us, however, assume that the 
population sizes are given, but choose a random number n of seats. What is 
the probability that the Alabama paradox occurs if n is increased to n+ 1? 

By choosing a random number n we mean choosing n uniformly at ran- 
dom from {1,2,..., N} for some large integer A^, and then taking the limit 
(assuming that it exists) as — t- oo. Thus, more formally, let Si{n) be the 
number of seats state i receives when n seats are distributed. Increase the 
number of seats n, by one seat at a time, from 1 to N, and let Vi{N) be 
the number of times that state i suffers from the Alabama paradox, i.e., 
the number of n < such that Si{n + 1) < Si{n). If Vi{N)/N converges 
to some value qi as A^ — t- oo, we say that the limit qi is the probability that 
state i suffers from the Alabama paradox. (This approach, to consider given 
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sizes but a random number of seats, is used also in [5] where some other 
properties of election methods are studied.) 

In order to calculate this limit (and show that it exists), we will make one 
mathematical simplification. Recall that a set {xi, . . . ,Xk} of real numbers 
is linearly independent over Q if there is no relation aixi + • • • + akXk = 
with all coefficients Oj rational and not all 0. (Equivalently, there is no such 
relation with integer coefficients aj, not all 0.) We will assume that the 
relative population sizes are linearly independent over Q. 

Remark 1.2. Mathematically, this assumption is reasonable, since if we 
choose pi, . . . ,Pm at random (uniformly given that their sum is 1), they will 
almost surely be linearly independent over Q. However, for the practical 
problem of apportionment, the assumption is clearly unreasonable since the 
populations Pi are integers and the pi thus rational numbers. Nevertheless, 
the formula below is a good approximation if the numbers pi have large 
denominators and there are no relations 

aipi H h amPm = (1.1) 

with small integers oi, . . . , Um- More precisely, it will be shown in Section 7.2 
that for any e > 0, there is an ^ = A{m, e) such that the value qi in 
(1.2) or (1.3) differs from the exact probability by less than e for every 
distribution (pi)™ for which there is no such relation (1.1) with integers a, 
and Yli l^-il ^ we omit the details. 

We leave it as an open problem to extend the result below and find ex- 
act formulas for all pi, . . . ,pm, and in particular for rational pi, ■ ■ ■ ,Pm- (If 
pi, . . . ,pm are rational, then the sequence Si{n + 1) — Si{n) will be periodic, 
so the limit qi certainly exists; the existence in general is shown in Sec- 
tion 7.2.) Note that some modifications are required for rational pi, . . . ,pm- 
For example with three states and pi = P2 = "^f^, ps = 1/5, it is easily seen 
that the Alabama paradox never occurs, so the probability is for all three 
states. See also Proposition 7.4, were it is shown that the expected number 
of states suffering from the paradox could be arbitrarily close to 1. 

We use the standard notations := max{x,0} and x_ := (— x)+ = 
— min{a;,0}, noting x = x+ — X- and |rE| = x+ + X-. Let ek{xi, . . . ,Xn) 
denote the elementary symmetric polynomial of degree k m n variables, 
i.e. ek{xi,...,Xn) := J2i<ii<-<i^:<nl\'j=i^ir With Be(p) we mean the 
Bernoulli distribution; thus / ~ Be(p) if P(/ = 0) = 1 — p and P(/ = 1) = p. 
We let p(i) > • • • > be the population vector pi, . . . ,Pm rearranged in 
increasing order, and let . . . , be the corresponding probabilities of 
the Alabama paradox, which by Corollary 2.3 below is the vector qi, . . . ,qm 
rearranged in increasing order. For clarity, we will use the notation p(j) and 
whenever we consider the states in increasing order, and pi, qi only when 
the order is irrelevant. 

Theorem 1.3. Suppose that m states have relative sizes pi, . . . ,pm, with 
YliLiPi = Ij o,iT-d assume that pi, ■ ■ ■ ,Pm o,re linearly independent over Q. 
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Then the probability qi that state i suffers from the Alabama paradox when 
we increase the total number of seats by one equals 

^-=^IE(5r-5+-l)^, (1.2) 

where Sf = Y.r.p^<pjf and = J2j:p^>pjf with if ~ Be{\pi - pj\) 
and Ij^,..., Im independent. More explicitly, if the states are ordered with 
P{1) > • • • > P{m)> ^^^-5 can be written 

s=0 k=2 ^ ^ 

(1.3) 

where rf := 

In other words, each /j*^ € {0, 1} with P(/j*^ = 1) = |pj — Pj\- 

Remark 1.4. If we do not order the states, then equation (1.3) can equiv- 
alently be written as 

m-3m-s-l / I i, o\ 

«^ = ^E E (-ir'=p^>4e--->e)e.Ml...,e), 

s=0 fc=2 ^ ^ 

(1.4) 

where rj^ := {Pi — Pj)±- Since (1.4) is symmetric under permutations of 

(i) 

Pi, ■ ■ ■ ,Pm, we may assume > • • • > P[m)- this case, r^^ = for j < z 

and rj*2 = for j > i, and it is easily seen that the sums in (1.4) and (1.3) 
are equal. 

The proof of the theorem is given in Section 4. We give first several 
consequences of the main theorem in Sections 2-3; the proofs of these results 
are given in Sections 5-6. 

For simplicity, we have here considered one state at a time. It may happen 
that the Alabama paradox occurs for two (or more) states at the same time, 
although this is less likely, see Section 7.1. 

2. Further results 

In the case of three states. Theorem 1.3 yields the following simple for- 
mula. 

Corollary 2.1. Suppose that there are three states with relative sizes p^i^ > 
P{2) > P{3) with + P(2) + P{3) = 1; ^nd assume that P{i),P(2),P{3) are 
linearly independent over Q. Then only the smallest state can suffer from 
the Alabama paradox, and the probability of this is 5(^(1) ~ P{3)){P{2) ~P{3))- 

The supremum of this probability over all distributions {p{i),P(2),P(3)) is 
1/12, which is never attained but is approached in the extreme case when 
P(3) is very small and and p(2) both are close to 1/2. 
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In general, the Alabama paradox can affect any state except the two 
largest, but it is much more likely to affect small states. 

Remark 2.2. Note however that Hamilton's method is unbiased. On the 
average, state i increases its number of seats by pi each time n is increased, 
so if it sometimes suffers from the Alabama paradox and its representation 
decreases with frequency qi, this must be compensated by a frequency + 
of the times when the number of seats increases. 

Corollary 2.3. In addition to the assumptions of Theorem 1.3, assume 

that > • • • > p(rn)- Then q^^n) > 9{m-i) > ' ' ' > 9(3) > Q{2) = 9(1) = 0. 
Moreover, the largest probability is 

^ m—l 

= IK^ - ^Pu) -pm)) -Pir^^y (2-1) 

We have, for any i, the inequalities 

1 [e-i - mp(,) -^Yl P^) ^ m < - (l ^) < . (2.2) 

m\ ^' 2^-^ ■'J ^' m\ m — l/ m 

j 

If P(m) ~^ s-i^cl all other — > l/(m — 1), then (2.1) shows that 

9(m) ~^ m(l ~ m^)'" ^' ®° ^^^^ ^ given m, the least upper bound on 

the probability of the Alabama paradox for a specific state in the linearly 
independent case (but only in that case, see Section 7.2), which for large m 
approaches 1/me. 

Corollary 2.4. Under the assumptions of Theorem 1.3, the expected number 
of states suffering from the Alabama paradox each time the number of seats 
is increased is less than 1/e. This bound is approached if we let m oo and 
suppose that m — o{m) of the states are very small, with pi = o(l/m), and 
that the remaining states are medium-size with pi = o(l). 

In this extremal case, the paradox is thus very common. It can be even 
more common in the rational case, see Proposition 7.4. See also Exam- 
ple 7.2, where we show that the probability of at least one state suffering 
the Alabama paradox in this case converges to 1 — 2/e ~ 0.264. We con- 
jecture that this is the upper bound of the probability that at least one 
state suffers the paradox (under the assumption of Theorem 1.3); note that 
Corollary 2.4 shows that the probability is always less than e~^ ~ 0.368. 

The exact formula in (1.3) is unwieldy when m is large; it may then be 
attractive to use (1.2) with a Poisson approximation of 5"^^: 

Corollary 2.5. Under the assumptions of Theorem 1.3, let (for a given i) 
:= T.j:p^<pMi ~Pi^ "■'^^ '■= ^j:pj>pM - Pi)' Let further S+ ~ 
Po(A'^) and 5~ ~ Po(A~) be independent Poisson random variables, and 
define 

qi:=-E{S- -S+ -1), (2.3) 
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Then 



= i 5^0-_._l)(^4p!,---- (2.4) 

m ^ ^ ^ ]\k\ ^ ^ 

j>k+2 ■' 

(2.5, 

j<k 

^ m 

h-Qil < —^{Pj -Pi)^- (2.6) 

3 = 1 

3. Average probability of the Alabama paradox 

We have so far considered the probabihty of the Alabama paradox for 
given relative population sizes , . . . , . Let us now instead fix m > 3 
and consider the average probability over all population distributions. In 
other words, in this section we let (pi, . . . ,Pm) be random and uniformly 
distributed over the simplex A*" := {{pi, ■ ■ ■ ,Pm) £ [0, 1]"^ : X^iK = 1}) ™d 
take the expectation. Note that then pi, . . . ,Pm sue linearly independent 
over Q a.s., so we may in the sequel assume that Theorem 1.3 and its 
corollaries apply. 

As above, > ••• > denotes the population vector pi,...,pm 

rearranged in increasing order; note that (p(i), . . . ,P(m)) is uniformly dis- 
tributed over the subset > • • • > P(m) of the simplex A*". Similarly, 
Q{i) < ■ ■ ■ < Q{m) &re the corresponding probabilities of the Alabama para- 
dox, which by Corollary 2.3 are qi, . . . ,qm rearranged in increasing order. In 
particular, g'(^) is the probability of the Alabama paradox for the smallest 
state. Note that = (7(2) = but a.s. (/(fc) > for /c > 3. Since Qi and 
are functions of pi, . . . ,Pm, they too now are random variables, and we may 
ask for their expectations, or other properties of their distributions. We use 

the notations and — ^ for convergence in distribution and probability, 
respectively, always as m — > 00. 

In the case m = 3, the average of the formula in Corollary 2.1 over all 
(P(i) 5^(2) 5^(3)) is easily found by integration, for example by the substitution 
= 1 — P(3) ~P{2) and integrating over (p(2))P(3)) with the conditions < 
P(3) < I and p(3) < p(2) < (1 ~P(3))/2; a calculation yields the probability 
Eg(3) for the Alabama paradox for three states of random sizes as 1/36. 

We extend this to larger m. First we consider only the smallest state, 
which is most likely to experience the paradox. Recall the notation m'' := 
m{m + 1) . . . (m + k — 1) for the rising factorial. 

Theorem 3.1. With uniformly random relative population sizes, the ex- 
pected probability E g^^) that the smallest state among m states will suffer 
from the Alabama paradox is 

= - E(-i)'H^ --2 = - E(-i)'H^ (3-1) 

k=o '"' k=2 '"' 
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Hence, as m ^ oo, 



E. 



(m) 



m 



1 

■ oo. 



+ 



(3.2) 



Furthermore, mq(j^-^ — ^ e ^ as m 

Hence, for large m, the expected value of the largest probability g(^) is 
asymptotically equal to the upper bound e~^/m given in Corollary 2.3, and, 
furthermore, q(jn) is close to /m for most pi, . . . 

Remark 3.2. The sum in (3.1) is a hypergeometric sum and the result can 
be written 

1 „ . .1 



(m) 



m 



iFi(l — m; m; 1) 



(3.3) 



with a confluent hypergeometric function iFi (in this case a polynomial). 
For small m we have the following table. 



m 


2 


3 


4 


5 


6 


7 


8 


9 







1 


17 


61 


907 


153709 


855383 


134964353 


36 


480 


1680 


25920 


4656960 


27675648 


4670265600 



For a fixed (or random) state, we obtain a more complicated formula. 

Theorem 3.3. With uniformly random relative population sizes the aver- 
age probability E qm that a given state among m states will suffer from the 
Alabama paradox is 



m—3m—s—l s 



E, 



-J III, — 'J III, — J — 



m 



=0 fe=2 1=0 j=0 

(m- 1)!^ 



s + k-2 
s 



k\il{s — i — jy. (m — 1 — k — sy. {m 
For small m we have the following table. 



i + k + sy. 



{i + k + i)~^-^. 



m 


2 


3 


4 


5 


6 


7 


8 


Egm 





1 


17 


523 


2287039 


100704757 


404675341849 


108 


1440 


43200 


195955200 


9144576000 


39230231040000 



The average probability Eg^ in Theorem 3.3 is, of course, always smaller 
than the average of the largest probability Eg(^) in Theorem 3.1. It is 
somewhat surprising that it is not much smaller, the ratio is close to 1/3, 
as is shown for some small m by the following table (with rounded values 
computed by Maple): 



m 


3 


10 


20 


30 


50 


100 


¥.qrn/^q{m) 


0.33333 


0.33392 


0.33441 


0.33457 


0.33474 


0.33487 



Indeed, this persists for large m, and E^m, is really of order 1/m, since 
Theorem 3.3 implies the following asymptotic formula by dominated con- 
vergence: 
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Corollary 3.4. With uniformly random relative population sizes, the ex- 
pected number of occurrences of the Alabama paradox among all m states is 
m E Qm which as m ^ oo has the limit 



s=0 k=2 i=0 j=0 ^ ^ 



kill (s — i — jy. 



Thus, E 

Qm ^ b/m. We do not know any better closed form for 6, but 
numerically we obtain from Maple b ~ 0.12324 and thus, using Theorem 3.1, 
Eg^/Eg(^) be 0.33501, in accordance with the table above. (If Maple 
is correct, the limit is not exactly 1/3, but quite close.) 

The formula for b in Corollary 3.4 as an alternating quadruple sum is 
not very illuminating, and it is not even easy to see that 6 > from it, but 
that at least follows from the alternative representation in the next theorem, 
which adds more information on the asymptotic relation between size and 
probability for the Alabama paradox for random populations. 

Theorem 3.5. Define, for any A~, A"^ > 0, with ~ Po(A^) independent, 
similarly to (2.3) -(2.5), 

$(A-,A+) :=E(5~-S+-l)^ (3.4) 
-E«--)^^— (3.5) 

j>k+2 



,(A-_y(A+)\_,-_,+ 
j\k\ 



+ ' e-^-^" + A- - A+ - 1, (3.6) 



and'^{x) := <^(e ^ — l + rr). Then, as m ^ oo, with random population 
sizes, 

{m.prn,mqm)^ {T,-^{T)), (3.7) 
where T ^ Exp(l). In particular, 
mEg„ b = E^(r) 



j>k+2 



(3.8) 



By Theorem 3.5 and its proof, we can further say that, assuming random, 
uniformly distributed populations, a state with pi = x/m has probability 

?a ^(x)/m of suffering from the Alabama paradox. Note that the extreme 
case in Theorem 3.1 can be seen (informally) as the limiting case x = 0, since 

we have mp{jn) -^-^ and mq^j^-^ — )• e~^, and indeed ^(0) = <I>(1,0) = e~^, 
e.g. by (3.6). 

We plot the function ^ in Figure 2. 
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Figure 2. The function ^ in Theorem 3.5. 

4. Proof of Theorem 1.3 

We analyse the process of successive distributions of seats as fohows. 

Think of the different states as m runners on a circular track, with state 
i running at constant speed (laps/time unit). At time n, state i has run a 
distance npj, and thus [npij full laps, so the number of seats that it gets is 
the number of completed laps, plus an additional seat for each of the states 
that have come furthest on the next lap; the number of these additional 
seats is chosen such that the total number of seats is n. 

We reformulate this by moving the finishing line; we mark its position 
by a flag and count laps as runners pass the flag. We place the flag by 
the runner that got the last additional seat, i.e. the state with the smallest 
remainder that is rounded up. Then the number of seats a state gets equals 
its number of laps, for every state. (The assumption that pi,...,pm. are 
linearly independent over Q implies that ties cannot occur, so we do not 
have to worry at all about ties in this proof.) 

Let us increase the total number of seats from n to n + 1 in two steps. 
We first increase time from n to n + 1 continuously, letting the runners run, 
but at the same time we also move the flag, by letting it be carried by a 
runner, so that the total number of laps stays at n. This means that if the 
runner carrying the flag overtakes another, slower, runner, then the flag is 
passed to the slower runner and both runners keep the same numbers of 
laps. On the other hand, if the runner carrying the flag is overtaken by a 
faster runner, then the flag is passed to the faster runner, who gets one lap 
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more, while the former flag-holder loses one lap. (Other overtakings do not 
affect the flag, nor the number of laps for anyone.) 

Finally, at time n + 1 we increase the total number of seats by one; this 
means that the runner carrying the flag throws it to the next runner behind 
him (her), who gains another lap. 

We count positions, at any given time, relative to the flag and say that 
position is the runner carrying the flag, positions —1, —2, . . . are the run- 
ners behind the flag-carrying runner, and positions 1,2,3,... the runners in 
front of him/her. (Since the track is circular, position k and position k — m 
are the same, but that does not matter as long as we take a little care.) It is 
easy to see that when one runner overtakes another, their positions (which 
differ by 1) are exchanged, while all other positions remain the same; this 
hold also if one of them carries the flag. 

Consider a specific runner, say runner i. The position of i increases by 
1 each time i overtakes someone else, and it decreases by 1 each time i is 
overtaken. Furthermore, it increases by 1 at the final step when the flag 
is thrown. Thus, if is the number of runners overtaken by i, and S~ 
the number of runners overtaking i, during the interval [n,?i + 1], then the 
position is increased by S*"*" — S~ + 1. Since the number of laps is changed 
only when the position changes between and —1, we see that: 

(+) State i gains a seat if — + 1 > and runner i has at time n 
one of the positions —1, —2, . . . , —{S^ — S~ + 1). 

(— ) State i loses a seat if — S~ + 1 < and runner i has at time n 
one of the positions 0, 1, ... , \S'^ — S~ + 1\ — 1. 

Case (— ) is thus when the Alabama paradox occurs for state i. Let L be 
the position of runner i relative to the flag at time n, normalized to have 
L G {0, . . . , m — 1}. Then the Alabama paradox occurs if and only if 

-S- + l< -L. (4.1) 

Let the indicator Ij' be 1 if i overtakes j during [n, n + 1], and otherwise; 
similarly, let IJ' be 1 if i is overtaken by j and otherwise. Then = Ij~ 
and S~ = '^jlj- (Note that no runner can overtake another more than 
once during [n, n + 1].) We let {x} := x — [xj € [0, 1) denote the fractional 
part of a real number x. (We will also use {} to denote sets; the meaning 
should be clear from the context.) Then 

/+ = 1 <;=^ Pi > pj and < {npj - npi} < Pi - pj, (4.2) 
IJ = 1 <J=^ Pi < Pj and 1 - {pj - Pi) < {npj - npi} < 1. (4.3) 

We calculate the probability of (4.1) by finding the asymptotic joint dis- 
tribution of L and the fractional parts {npj — npi}, j ^ i, where again we 
choose n uniformly at random with 1 < n < A*", and then let A^ oo. By 
the formulas above, this gives the asymptotic joint distribution of , S~ 
and L, and thus the (asymptotic) probability of (4.1). 
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We say that an infinite sequence {vn)n>i € [0, 1)*" ^ x {0, ... ,m — 1} is 
uniformly distributed if the empirical distributions N^'^ S^=i ^v„ converge 
to the uniform distribution as — > oo, where 6v„ denotes the Dirac measure. 
This means that if A C [0, 1)"*-^ with X{dA) = and A; G {0, . . . , m - 1}, 
then #{n < N : Vn & A x {k}}/N ^ \(A)/m. (Here A is the usual 
Lebesgue measure.) This is a simple extension of the standard notion of 
uniform distribution for a sequence in [0, We claim the following. 
(For notational convenience, we state the case i = 1 only.) 

Lemma 4.1. Suppose that pi, ■ ■ ■ ,Pm ire linearly independent over Q, and 
let Ln S {0, . . . , m — 1} be the position of runner 1 relative to the flag at time 
n. Then the sequence of vectors f„, = {{n{p2 — Pi)}, ■ ■ . , {n{pm — Pi)}, -^n), 
n> 1, is uniformly distributed on [0, l)"*"-*^ x {0, . . . ,m — 1}. 

We postpone the proof and first complete the proof of (1.2). 

By (4.2) and (4.3), for each j, at most one of Ij' and /~ is non-zero, 
depending on whether pj < pi or pj > pi. We simplify the notation by 
letting Ij = /+ + IT; thus 5+ = Ej:p,<p, Ij and = 'Ej-.p^yp, Ij- 

For a given A'^, these are random variables, and we have, letting P^r denote 
the probability when n is uniformly chosen with n < N, 

1 

Fn{S+ - + 1< -L) = Y^ Fn{L = £ and 5- - 5+ - 1 > £). (4.4) 

As N ^ oo, Lemma 4.1 and (4.2)-(4.3) show that the distribution of L = Ln 
converges to the uniform distribution on {0, . . . , m — 1} and the distribution 
of Ij converges to Be(|pi — Pj\) for all j 7^ i; moreover, these are asymptoti- 
cally independent. Hence, (4.4) yields 

m—l 

Fn{s+ -s- + i<-l)^Y,- f{s- -s+-i>e) 

£=0 ^ 

= -E(5r-5+-l),, 

where 5+ = E,.p,<pjf and 5" = Ej:p,>pjf with if ~ Be{\p. - p,\) 
independent. This is the result stated in (1.2). 

We proceed to show Lemma 4.1. First recall a well-known result by Weyl. 
(The standard proof is by showing that the Fourier transform (characteristic 
function) ;^ X]^=i exp(27ri ^^^^^ nj{nyj}) 0, as N ^ 00, for any fixed 
integers ni, . . . ,nk, not all 0, see for example [4, Exercises 3.4.2-3].) 

Lemma 4.2 (Weyl). Suppose that yi,...,yk and 1 are linearly indepen- 
dent over Q. Then the sequence of vectors ({nyi}, . . . , {nj/fc}) S [0,1)^^ is 
uniformly distributed in [0, 1)^. □ 

We will need the following extension. Let Modm(x) = m{x/m}; this 
is the remainder when x is divided by m. Thus, if r is an integer, then 
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Modm(r) is the unique integer in {0, . . . , m — 1} such that Modm{r) = r 
(mod m). 

Lemma 4.3. Suppose that yi,...,yk and 1 are linearly independent over 
Q. Let £n = Modm((X]j=i b^Vjl ) ~ ''^) ^ {0, . . . , m — 1}. Then the sequence 
of vectors ({nyi}, . . . , {nyfc},£„) G [0, 1)*^ x {0, . . . , m — 1} is uniformly dis- 
tributed in [0, 1)'^ X {0, . . . , m — 1}. 

Proof. Let = yj/m and Wn = {wni, ■ ■ ■ ,Wnk) with Wnj = {nzj}, j = 
1, . . . ,k. Then zi, . . . ,Zk and 1 are hnearly independent over Q, and thus 
Lemma 4.2 (Weyl's theorem) shows that the sequence {wn)n>i is uniformly 
distributed in [0, 1)''". Further, 

nyj — mWnj = mnzj — m{nzj} = m\nzj\ = (mod m). 

Hence, 

{nyj} = {mWnj} and ["-^jJ = Y^'^nj\ (mod m). (4-5) 

Thus, Modm{[nyj\) = [mwnj\- 

Let £„j = Modm,( L'^yjJ ) = If a sequence (n„) is uniformly dis- 

tributed in [0,1), then (m,n„) is uniformly distributed in [0, m) and the 
vectors {{mun}, [mn„J) are uniformly distributed in [0, 1) x {0, . . . , m — 1}. 
Using this argument in each coordinate, the fact that {wn) is uniformly 
distributed in [0, 1)'^ implies that the sequence of vectors {{nyj},inj)'j=i ~ 
{{mWnj}, \jnwnj\)^j=i uniformly distributed in [0, l)'^ x {0, . . . ,m — l}'^. 

Let In = y^odm{J2]=i L^yjJ) • Then = Mod^d^^^^i Inj) , and it follows 
that the sequence of vectors ({nyi}, . . . , {nyk},ln) is uniformly distributed 
in [0,1)^ X {0,... ,m- 1}. 

This is almost what we claim. To complete the proof, we consider a 
subsequence of the form n = mv + no, f > 1. Weyl's theorem holds 
for such subsequences too, as a consequence of the version in Lemma 4.2 
since {{mi/ + no)pj} = {^{mpj) + n^pj} where mpi, . . . , mpk and 1 are lin- 
early independent over Q, and the constant shift by n^pj does not affect 
the uniform distribution. Consequently, the argument above shows that 
({nyi}, . . . , {nyfc}, ^„) is uniformly distributed for each such subsequence. 
But along the subsequence, ^„ = Modm(^n — n^), so ({nyi}, . . . , {nyk},tn) 
is uniformly distributed for each such subsequence, and thus for the entire 
sequence. □ 

Proof of Lemma 4-L Suppose that runner 1 carries the flag at time n, i.e., 
Ln = 0. Then state 1 gets an additional seat, i.e., its number of seats is 
rounded up to [npi] , and state j gets 

\npj — {npi}] = \npj — npi~\ + [npi\ = [npj — npi\ + \npi\ 
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seats. Since the total number of seats is n, we have, stih in the case L„ = 0, 

m m 

n = [iipj — npi\ + m\npi \ = [npj — npi\ (mod m). 
i=2 j=2 

In general, there are L„ additional states whose numbers of seats are rounded 
up (or —Ln fewer, if L„ < 0); thus 

m 

n = Ln + [npj — npi\ (mod m) 

and 

m 

Ln = n — ['n{pj — pi)\ (mod m). (4-6) 
i=2 

Let yj = pj — pi, j = 2, . . . ,171. Since pi, ■ ■ ■ ,Pm are linearly independent 
over Q, it is easily seen that y2,---,ym and YIT Pj ~ ^ ^^^^ linearly 
independent over Q. (This can be seen as a change of basis, using a non- 
singular integer matrix, in a vector space of dimension m over Q.) Thus 
Lemma 4.1 follows from Lemma 4.3 (with k = m — 1, after renumbering 
2/2, • • • , ?/m), since L„ = -in (mod m) by (4.6). □ 

This completes the proof of (1.2). We proceed to show that this can be 
evaluated as (1.3). 

We may assume that the states are ordered by size, pi > ■ ■ ■ > pm] 
thus Pi = p(^iy Consider the i:th largest state. Let X- C [i — 1] and 
X+ ^ + 1, . . . , m}, where as usual [n] := {1, . . . , n}. We think of X- and 
X+ as the indices j for which /j*^ = 1. Let P=(X_,X_|_) be the probability 
that {j : /j*"* = 1} = X^ U X^. Then clearly, for simplicity writing fj : = 
= \Pi -Pjl 

p=(x_,x+)= n n (i-^i)- 

j&X-UX+ jGH\{X_UX+U{j}) 

The formula (1.2) for the probability that the r.th. largest state suffers 
from the Alabama paradox is thus 

(ix„j-jx+i-i). n n (1-^^-)' 

X-(Z[i-l],X+Q{i+l,...,m} j&X^VjX+ j£[m\\{X-\jX+\j{i}) 

where the sum runs over all pairs of subsets such that \X- \ > \X^ \ + 2. 

Now, for any s > 0, A; > 2 and any monomial n^=i ' n^'=i where 
1 < ji < ■ ■ ■ < jk 1^ i — ^ and i + 1 < /i < • • • < < jn, we get the coefficient 

h=2 u=0 V / V / 
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Now, we only need to prove that A[k,s) = ('''''^~^)- 

For notational convenience we will prove this statement for k+1. We start 
by splitting the first binomial coefficient, and then substituting /i — )■ /i + 1 
in the second part. 

fe+l / /, \ / , \ \ h-2 



A,. + ,„) . B-D' (C) - (, ! 0) E(-ir(:) (" — 1) 

h=2 V V / v / / u=o ^ ^ 



h=\ ^ ^ u=0 



h=\ 

h~2 



h=2 ^ ' 11=0 ^ ' h=l ^ ^ ^ ^ 



Using Lemma 4.4 below on the first sum and the Vandermonde convolution 
ELo ( ■) {^^) = on the last we get 

An , , ^ /s + k-l\ /s + k\ (s + k-l 

The formula (1.3) follows from the following lemma. 
Lemma 4.4. For any integers k > 2, s > we have 

i:MK:)|Mr(:)^c::r) 

Proof. First note that by standard binomial identities we have 

We may now use the Vandermonde convolution to get, with j = h — 2, 

E(-'K:)E(-r(:)^EGJ_,)C-OK'rr)- 

h=2 ^ ^ «=o ^ ^ j=0 ^ J/ \ J / \ / 

□ 



This completes the proof of Theorem 1.3. 
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5. Proofs of corollaries 

Proof of Corollary 2.1. When m = 3, the double sum in (1.3) is non-empty 
only if i = 3; in this case there is a single term with s = and k = 2 and 
the result follows immediately. 

Alternatively, we can use (1.2): — Sf — 1 > is possible only with 
S~ = 2 and Sf = 0; this requires that at least 2 states are larger than state 
i so i = 3, and in this case the probability is 



^P(53" - = 2) = ip(/f) = if = 1) = ^(P(i) -P(3))(P(2) -P(3)). □ 



Proof of Corollary 2.3. The fact that q(jn) > ^(m-i) ^ • • • follows from (1.2) 

and a simple coupling argument. Furthermore, if i < 2, then < 1 and 

Q{i) = 0. 

For i = m, (1.3) simplifies to 

m—1 

•mq{rn) = ^{-'^)''ek{{p(l) - P(m) ),•••, (P(m-l) - P{m))) 
k=2 

m—1 m — 1 

= n - -Pim))) + Y.iPij) -P(m)) - 1 

3=1 i=i 

m—1 

= n ~ (PO) -P{m))) - mp^m), (5.1) 

which is (2.1). 

Furthermore, it follows from (2.1) (or (1.2)) that g^^) will increase if we 
decrease p(^) to and simultaneously increase say. For = 0, 

the product in (5.1) is largest when all pj for j < m are equal, i.e. = 
l/(m — 1) for j < m; in this case (5.1) yields (1 — l/(m — 1))™""^. Hence, 
for any p(^,^-^ > and any i, 

1 / 1 1-1 / ^ 

9(i)<9(m)<— 1 7 < — e \ 5.2 

Finally, for any i, 

H^r-^t) = {Pj-Pi)- X] (Pi-Pj) = ^{Pi-Pi) = '^-mpi. (5.3) 

Pj>Pi Pi<Pi j 

Consequently, using (1.2), 

mg. = E{Sr _ 5+ - l)^ > E ((5r - l)_^ - 

= E(5--l + l{Sr=0})-E5+ 
= E5- - 1 + P(5r = 0) - E5+ 

= P(5r =0)-mpi= {Pj - P^)) - mp^. (5.4) 

Pj >Pi 
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If X G [0, 1], then 1 — x > e~^' — ^x^, and it fohows that 

n (1 - -Pd) > n ^'^"''"'^ -\n -p^f ^ ^"^^'^ - \ y.pi 
Pj>pi Pj>pi pj>Pi j 

which by (5.4) yields the lower bound in (2.2). □ 
Proof of Corollary 2.4- The expected number is qj, and (2.2) shows that 

In the special case, (2.2) shows that Qi ^ e ^ jm for each small state, and 
thus Ylij ~ □ 

Proof of Corollary 2.5. It is well-known and easy to see that if / ~ Be(r) and 
Y ~ Po(r), then / and Y may be coupled with E |/— 1"| = 2(e~'' — l+r) < r^. 
Couple in this way ij*^ with Yj ~ Po(|pj —Pj\), with the latter independent, 

and define 5+ = T.r.p,<pJi ~ Po(A+) and 5" = T.r.p,>pJ^ ~ 
Then, by (1.2), (2.3) and the triangle inequality, 



m\qi -qi\< ^S- - Sf - (5" - ^+)| 



j:pj>Pi j-Pj<Pi 

^2 



- E iP3-Pif+ E ^Pi~Pi' 
j.Pj>Pi j-Pj<Pi 

This proves (2.3), which immediately yields (2.4). To obtain (2.5) we observe 
that the sum is E(5+ + 1 - S')^ = -E(5" - S+ - l)_, and thus the 
difference between the sums in (2.4) and (2.5) equals 

E(^" - 5+ - 1)^ + E(5" - 5+ - 1)_ = E(S" - 5+ - 1) 



E ^Pi - Pi)- E ^P' ~ P^) ~ ^ = ^(Pj -Pi)-'^ 

i-Pj>Pi j-Pj<Pi j=i 

1 — mpi — 1 = —mpi. □ 



6. Proofs of results on average probabilities 

We let := {(xi,...,x^) G [0, cx))"* : Zi^i = a} and A^^ := 

{{xi, . . . ,Xm) G [0,00)™ : — When integrating over A^, we 

use the measure dx := dxi • • • dx^-i; this is thus the same as integrating 
over A^-i with Lebes gue measure, keeping Xm = 1 — Xli" ^i- Note that 
the volume of AJ^ equals the volume of A!J^~^, i.e. a"^~^/{m — 1)1. Hence, 
the uniform probability measure on A"* = A™ is (m — 1)! dx. 

More generally, we have the well-known Dirichlet integral 

x--^ • • • x<^-~' dx = a"^+-+---iII|iiM (6.1) 
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for any ai, . . . , Om > 0. (For m = 2 this is the standard Beta integral and 
the general case follows easily by induction. An alternative, probabilistic, 
standard proof is to let Ti,...,Tm be independent Exp(l) variables and 
evaluate E(rf ^"^ • • • T^™"^) by conditioning on Ti + • • • + T^.) 

Proof of Theorem 3.1. Recall from Corollary 2.3 that if we assume pm < Pi 
for all i and let := pi - pm, then = ^ 11^111^(1 " "^0 " Pm- Choosing 
a vector (pi, . . . ,Pm) uniformly from A™, there are m possibilities for the 
position of the minimum coordinate; by symmetry it suffices to consider the 
case when pm is the minimum (multiplying below by a factor m). Then the 



vector (ri, . . . , r^-i) is uniformly distributed in A'^^^^^^^ and dx = dpm dr. 



, m—l 

. ,rm—l) IS umiormiy uisiiiuuieu m z. 

We thus get 

-1/m '■ / - m-l 



|■^/rn r / 1 \ 

) =m{m -1)1 / — TT (1 - r-j) - dr dpm- 

(6.2) 

We treat the two terms in the bracket separately. First, using symmetry 
and (6.1), 

f-l/m p m—l 

(m-1)! / / TT (1 - r,) dr dp„ 

^ l-mpm t=l 

m—l / ^\ /•l/m f ^ 



k=0 
m—l 



^\k ("l-l)! 



^ V ^ / "ir(/c + m) 
^t^Q \ k J m'' 



Similarly, the second term becomes 

-1/m f /•l/m 

m{m — 1)! 



/ / Pmdr dpm = m{m - 1) / (1 - mpm)"^~'^Pm dpr, 

Jo ^reAr4,n 



m In m? 



(6.4) 

The formula (3.1) follows from (6.3) and (6.4), noting that the first two 
terms in the first sum equal 1 — ^^^^^ = which cancels the term — 1/m^. 

The asymptotic expansion (3.2) is easy to deduce from (3.1). (One can 
also easily obtain further terms.) 
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Since < /m a.s., we have E \e~^ — = E(e~^ — mq(^^-^) = 

— mE(7(^) — > 0, and thus mq(^^-^ e~^. □ 

Proof of Theorem 3.3. Take i = m. We take expectations in (1.4) and ob- 
tain by symmetry 

-...4giv.)-(-:-)G,,,„"-._,) 

■IE rQ(Pm-?'j)+n(^'^+'~^'™)+ • f^^-^) 

Vi=i 1=1 J 

We use the standard method of generating uniform pi, . . . , pm on A™ by 
taking i.i.d. exponential random variables Ti, . . . ,Tm ~ Exp(l) and letting 
Pi := Ti/Sm with Sm ■= Z^IILi Ti. Recall that then (pi, . . . ,Pm) and 5^, are 
independent, and that 5m has a Gamma(m, 1) distribution with ES^ = 
r(m + a)/r(m) for a G N. Hence, by conditioning on T — Tiji and using 
independence. 



(s k 
Y[iPm - Pj)+ Y[iPs+l - Pm) 
j=l 1=1 

E (nu(Trn - T,)+ ntiiTs+i - 



i+k 



^^""^ E((E((r-ri)+ I r))^(E((r2-r)+ I T))') . (6.6) 



{t-x)e-''dx = e-^ -l + t, (6.7) 



r(m + k + s) 
For any t > and j > 1 we have 

E(t-T,)+= r 

Jo 

/oo 
(x - i)e-^ dx = e"*. 

Hence, the final expectation in (6.6) equals 

E ( (e-^ - 1 + r) '^e-'^^) = (e-* - I + t) '^e'^^+i)* dt 



r T ( ' )e-''t^-iy-'-^e-'^'+'^'dt 
t^' \hJ,s-t-jJJo 



i+j< 

yi, j,s — i— jj{i + k + 



i+j<s 
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The result now follows from (6.5)-(6.9). □ 

Proof of Theorem 3.5. Note first that the expectation in (3.4) can be eval- 
uated as in (3.5)-(3.6). For (3.5) this is immediate; for (3.6) it follows from 

E(5" - 5+ - 1)^ - E(5- -S^ -!)_= E(5~ - 5^+ - l) = A" - A+ - 1. 

We let, as in the proof of Theorem 3.3, pi := Ti/Sm with Tj ~ Exp(l) 
i.i.d. and Sm '■= Yl^i'^i- Consider state 1 and condition on Ti, leaving 
T2,T3,... i.i.d. Exp(l). We define := Y^jiPi — Pj)± as in Corollary 2.5 
(with i = 1). As m oo, the law of large numbers shows that a.s., using 
(6.7)-(6.8), 

^ = Zi + yZl^0 + Er2 = l, (6.10) 

i=2 



s s ^ 

J=2 J=2 



^ E((ri - T2)+ I Ti) = e-^^ - 1 + Ti, (6.11) 

^ ^ (ri-r,)_ m^ (Ti-T,)„ 

„ Srr} Sry) „ TTli 

j=2 ™ j=2 

-^E{{n-T2)-\Ti) =e-^\ (6.12) 

First, by (6.10), 

TJli 

mpi = —Ti^Ti. (6.14) 

Next, we apply Corollary 2.5, noting that qi = ^{X~ , X^)/m. Hence, 
by (2.6) and (6.13), mqi — <1>(A^,A''") a.s. For any Ai,A2 > 0, we can 
couple Si ~ Po(Ai) and S2 ~ Po(A2) such that E |5i — < |Ai — A2I, and it 
follows from (3.4) and the triangle inequality that |$(Aj', A^j'') — <^>(A^, A^)| < 
\Xi — A2^| + |A^ — A^l for any , Xf , X2 , A^. Hence (6.11)-(6.12) imply that 
<1>(A^,A^) — )• ^'(Ti) a.s. Consequently, a.s. 

mqi ^{Ti). (6.15) 

The limit (3.7) follows from (6.14)-(6.15). Since mqi < e~^ a.s. by Corol- 
lary 2.3, (3.8) follows by dominated convergence toghether with (3.5). □ 

7. Further comments 

7.1. Several states at once. Several states may suffer from the Alabama 
paradox at the same time. A simple example is given in Figure 3. 

This can be analysed in the same way using the methods in Section 4. 
The typical case is when two states i and j are in positions and 1, and both 
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state 


pop. 




seats 
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pop. 




seats 


A 


28 


1.40 


1 




A 


28 


1.68 
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27 


1.35 
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1.62 
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27 


1.35 
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27 


1.62 
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D 


9 


0.45 
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9 


0.54 





E 


9 


0.45 


1 




E 


9 


0.54 





sum 


100 


5.00 


5 




sum 


100 


6.00 


6 



5 seats 6 seats 



Figure 3. A double Alabama paradox for states D and E. 
Numbers in boldface are rounded up. 

are overtaken by three other runners. However, other similar configurations 
are possible, and we leave the details to the reader except for two simple 
examples. 

Example 7.1. Suppose that there are 5 states, with pi > P2 > Ps > Pa > P5 
and (pi) linearly independent over Q. If a double Alabama paradox occurs, 
it has to be for states 4 and 5, with one of them in position 0, the other in 
position 1, and the three others overtaking both of them. Letting x be the 
distance between the runners 4 and 5, we find the probability, using uniform 
distribution as before, 

P3-P4 

(P3 -P4- X){p2 -pA- x){pi -P4-X) dx 

I rPi-P5 

+ T I (Ps - Pi){P2 - Pi){pi - Pi) dx 
1 rp:i-P5 

+ T (PS - P5 - X){p2 -P5- X){pi -P5-X) dx. 

Jp4-P5 

The integrals are easily evaluated (preferably by computer), but the resulting 
polynomial in pi, . . . ,p5 does not look particularly nice or illuminating so 
we omit it. In the extreme case pi ~ P2 ~ ~ 1/3, P4 ~ Ps ~ 0, the 
probability becomes 1/810. 

Example 7.2. Consider the extreme case in Corollary 2.4, with m — o{m) 
small states and o(m) medium-size states with the bulk of the population. 
In this case, the flag is most likely carried by a small state, say i. The 
probability that it is overtaken by a large state j is pj —pi^ Pj , and we can 
approximate the distributions of the number M of states overtaking it by a 
Poisson distribution with mean YljiPj ~ Pi)+ ~ ^jPj ~ ^- Thus i loses a 
seat if M > 1. Furthermore, since there are very many small states, their 
runners are very narrowly spaced, and most likely the runners in positions 
1, . . . , M — 2 are also small states, and are also overtaken by the same M 
states as i; in this case they all lose a seat. 
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0.6 


0.7 



Figure 4. Ties. Numbers in boldface are rounded up; num- 
bers in italics may be rounded up or down, as determined by 
lot. 

Consequently, if the random variable X„i is the number of states suffering 
from the Alabama paradox when n is increased (from a random value), then 

as m — 7- oo (in this case), Xm — — > X := (Y — 1)+ with Y ~ Po(l). Note 
that F{X > 0) = P(y > 1) = 1 - 2e~i and EX = E{Y - 1 + 1{Y = 0}) 
= F{Y = 0) = e-i. 

7.2. The linearly dependent case. The proof above uses Weyl's theorem 
and thus requires that pi, . . . ,Pm are linearly independent over Q; indeed, 
as said in Section 1, simple examples shows that Theorem 1.3 does not hold 
for arbitrary pi. We note also that some of the corollaries might be far 
from true in the case of rational relative sizes, see Proposition 7.4 below. 
Nevertheless, it is possible to use much of the argument above also for the 
linearly dependent case. We sketch this below, leaving many details to the 
reader. 

Note first that in general it may happen that the remainders //j — [/ijj 
happen to be equal for two or more states, and it may be necessary to round 
up one or several of these and round down the others; in this case, the choice 
is determined by lot. A simple example is given in Figure 4; note that state C 
may suffer the Alabama paradox either when increasing from 4 to 5 seats or 
from 5 to 6, but not at both times; the probability is 1/4 each time. For the 
asymptotic analysis this complication is no real problem, however, since it 
suffices to consider the expectation E Ui{N), i.e., the sum over n < N the 
probability of an Alabama paradox at time n. Indeed, although the example 
in Figure 4 shows that there may be a dependency between the occurrence 
of the Alabama paradox for some number n of seats and the next number 
n + 1, more distant occurrences are independent; i.e., the random sequence 
of occurrences of the paradox is 1-dependent. Hence the variance of the 
total number is 0{N), and by considering odd and even n separately, which 
yields two subsequences of independent random indicators, it is easily seen 
that (i^iiN) -Eui{N))/N almost surely. 

Remark 7.3. If two or several states have exactly the same population, it 
may be necessary to draw lots between them for both n seats and n + 1. If 
we do this independently, then some state might lose a seat by being unlucky 
the second time. This obvious consequence of drawing lots is not an example 
of the Alabama paradox and should be disregarded. (For example, we may 
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list the states with the same population in some random order, once and for 
all, and use this as a priority list for each n.) 

Let p := (pi, . . . ,Pm)- We now use qi for the true probability and iji 
for the value in (1.2) for any pi, . . . ,pm- Consider again, for notational 
convenience, state 1. The argument in Section 4 shows that, with L„ given 
by (4.6) (noting that the actual position may be different when there are 
ties), for some functions /, g and h and with zj = {pj —pi)/m, for simplicity 
assuming that is a multiple of m, 

N-l 

El^liN) = Y,f{MP2-pi)},...,MPm-pi)},Ln) 

n=0 
N-l 

= X] 9{{nz2}, {nz^m}, Modm(n)) . 

n=0 
N/m 

= ^ mh{{k{p2 - pi)}, {k{pni - pi)}) . (7.1) 
k=l 

(To see the last equality, write n = [k — l)m + / and define /i(x2, . . . Xm) '■= 
Z]/=o"^ 9i{x2 + (^ - m)z2}, . . . , {xm + (l - 'm)zm}, l)/m.) Lemma 4.2 does 
not apply when pi,...,pm are linearly dependent over Q, but the proof 
of it sketched above shows that the sequence [{n{p2 — Pi)}, • • • , {n-{pm — 
Pi)}) is uniformly distributed on a subgroup of [0,1)™"^; more precisely, 
the empirical distributions converge to the uniform probability measure /ip 
on this subgroup, which has Fourier coefficients given by 

71 (n n ) - 1^ Ej"L2«ife - Pi) e . . 

fj.p[a2, ■ ■ ■ ,am.) - < ,1 ■ ('-^i 
I U otherwise. 

The functions /, g and h are linear combinations of products of indicators 
and are a.e. continuous; moreover, h is ;Up-a.e. continuous. Hence, (7.1) 
shows that 'Ei'i{N)/N f hdfip, showing the existence of the limit qi = 
f hd^p in general. Note that in this notation, the value qi in (1.2) is 
qi = f hdfj,, where ji is the uniform distribution on [0, 1)™~^. 

The functions /, g and h depend on p, cf. (4.2)-(4.3), but if we write 
hp, then hp^ — > hp a.e. for any sequence of population distributions (for a 
fixed number of states) pi, P2, . . . , with p^ — )• p. Let p^ = {pik, ■ ■ ■ ,Pm.k)- 
If further, for every integer vector (ai, . . . , a^) ^ 0, we have J^^i o-iPik 7^ 
for all large fc, then it follows from (7.2) that /ip^(a2, . . . ,am) — ^ for all 
(02, . . . , am) 7^ 0, and thus /Xp^. — > /i = /ip. It follows, e.g. by [2, Theorem 
5.5] that qi^k = / hp^ d/ip^ ~^ J d/Up and also / hp^ d/x — > j hp d/ip so 
Qi,k — J hpf. dfi 0. The claim in Remark 1.2 now follows, since otherwise 
one could, for some e > 0, find such a sequence pk of distributions with 
\Qi,k — J hpf, d/i| > e and (by taking a subsequence) p^ — p for some p; a 
contradiction. 
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We end with a couple of counterexamples in the rational case. The upper 
bound qi < ;^e~^ in Corollary 2.3 is not true in general, not even for m = 3. 
As an easy example one may study three states with pi = P2 = 3/7 and 
P3 = 1/7, where = 1/7 > since the smallest state suffers when n 
increases from 3 to 4 (mod 7). 

Moreover, the upper bound 1/e on the expected number of states suffering 
from the paradox in Corollary 2.4 is not true in general. Consider the case 
when pi = ■ ■ ■ = pq = 1/7 and P7 = ■ ■ ■ = pioe = 1/700. When e.g. the 
number of seats changes from 42 to 43 all the 6 large states will change 
from 6 to 7 seats. Thus 5 of the small states will suffer from the Alabama 
paradox. The paradox will happen 98 times during the period of length 700; 
90 of these 5 small states will suffer, but the number is smaller close to the 
beginning and end of the period (n = 7, 14, 21, 28 or 671, 678, 685, 692 
(mod 700)) and the expected number of states to suffer from the Alabama 
paradox for a random number of seats is 

90 ■ 5 + 2 ■ (1 + 2 + 3 + 4) _ 47 
700 ~ 70' 

which is much larger than 1/e. This can be generalized in the following way. 

Proposition 7.4. Let x > 2 and y be positive integers, with x relatively 
prime to y — 1 and x^ — 3x < y. Assume that the number of states is 
m = y + X — 1 with relative sizes pi = ■ ■ ■ = Px-i = ^ md Px = ' ' ' = Pm = 
Then the expected number of states suffering from the Alabama paradox 
equals 

{x-2){y-x + l) ^ (7.3) 
xy 

which in particular can be made arbitrarily close to 1. 

Proof. The values of n (mod xy) where the Alabama paradox might occur 
are, e.g. by considering runners as in Section 4, ax + b, for integers 1 < a < 
y — 2 and < 6 < x — 1 such that ax + b > by and ax + 6 + 1 < (6 + l)y, 
i.e. b = \ax/{y — 1)J. There are y — 2 such values ax + b, one for each 
ae {l,...,2/-2}. 

Ifx — 2<a<y — x + 1, then for n = ax + b, with b as above, the x — 1 
large states will get a seats each and the remaining a + b seats will go to a + 6 
of the small states. For n = ax + b + 1, the larger states will get a + 1 seats 
each and only a + 6 — x + 2 seats are left to the small states; thus x — 2 small 
states will suffer from the Alabama paradox. If 1 < a < x — 3, then 6 = 0, 
and the number of small states receiving a seat will drop from a + 6 = a to 
as n increases from ax + b to ax + b + I. Finally, ify — x + 2<a<y — 2, 
then 6 = X — 1 and the number will drop from y to a + b — x + 2 = a+l, 
so y — 1 — a states will suffer. Summing these numbers give a total of 
(y — 2x + 4)(x — 2) + 2 Yli=i ^ = (y — x + l)(x — 2) states suffering in the 
period xy. 
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Taking e.g. y = and letting a; — )• oo, the expected number (7.3) tends 
to 1. □ 

We thank Warren D. Smith for asking a question that made us produce 
Proposition 7.4. It would be interesting to see a proof of 1 being a general 
upper bound or an example to the contrary. 
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